Maximally selected chi-square statistics and binary splits of nominal variables.

نویسنده

  • Anne-Laure Boulesteix
چکیده

We address the problem of maximally selected chi-square statistics in the case of a binary Y variable and a nominal X variable with several categories. The distribution of the maximally selected chi-square statistic has already been derived when the best cutpoint is chosen from a continuous or an ordinal X, but not when the best split is chosen from a nominal X. In this paper, we derive the exact distribution of the maximally selected chi-square statistic in this case using a combinatorial approach. Applications of the derived distribution to variable selection and hypothesis testing are discussed based on simulations. As an illustration, our method is applied to a birth data set.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximally selected chi-square statistics for ordinal variables.

The association between a binary variable Y and a variable X having an at least ordinal measurement scale might be examined by selecting a cutpoint in the range of X and then performing an association test for the obtained 2 x 2 contingency table using the chi-square statistic. The distribution of the maximally selected chi-square statistic (i.e. the maximal chi-square statistic over all possib...

متن کامل

Maximally selected chi-square statistics for at least ordinal scaled variables

The association between a binary variable Y and a variableX with an at least ordinal measurement scale might be examined by selecting a cutpoint in the range of X and then performing an association test for the obtained 2 × 2 contingency table using the χ2 statistic. The distribution of the maximally selected χ2 statistic (i.e. the maximal χ2 statistic over all possible cutpoints) under the nul...

متن کامل

Maximally selected chi-square statistics and non-monotonic associations: an exact approach based on two cutpoints

Binary outcomes that depend on an ordinal predictor in a non-monotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of a positive outcome is particularly high (or low). A chi-square test may then be performed to compar...

متن کامل

Maximally selected Chi-squared statistics and non-monotonic associations: An exact approach based on two cutpoints

Binary outcomes that depend on an ordinal predictor in a non-monotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of a positive outcome is particularly high (or low). A Chi-squared test may then be performed to compa...

متن کامل

سری آمار: تحلیل جداول توافقی 1 (آزمون‌های کای‌دو)

Assessing of outcomes and risk factors in the form of qualitative variables is common in the most of medical studies and the research objectives are defined as the relationship between these variables. This paper introduces the concepts and basic and applied statistical tests to examine the relationship between these variables in these studies, including chi-square tests. Principles and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Biometrical journal. Biometrische Zeitschrift

دوره 48 5  شماره 

صفحات  -

تاریخ انتشار 2006